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MODIFIED PROTEIN AND PEPTIDE PHARMACEUTICALS 



Cross -Reference to Related Application 
This application is a continuation-in-part of U.S. 
application 07/771,262 filed 4 October 1991. It is also a 
5 continuation-in-part of U.S. serial.no. 07/532,254 filed 1 June 
1990, which is a continuation-in-part of U.S. application 
07/313,646 filed 21 February 1989. The disclosures of the 
above-mentioned applications are incorporated herein by 
reference . 



1° Acknowledgment of Government Support 

This invention was made with government support under NIH 
Contract No. NOl-HD-9-2922 , awarded by the National Institutes 
of Health. The government has certain rights in this invention. 

Technical Field 

15 The invention relates to the field of administration of 

peptide and protein biologically active pharmaceuticals. More 
particularly, the invention concerns the use of modified 
peptides and proteins which contain extensions representing the 
carboxy terminal peptide of human chorionic gonadotropin or 

20 fragments thereof. 

Background Art 

PCT application WO 90/09800, published 7 September 1990, 
describes a number of modified forms of reproductive hormones. 
As described in the PCT application, any biologically active 

25 protein, such as a hormone, a cytokine, a hormone regulator and 
the like, can be modified so as to improve its clearance 
characteristics by providing it with an extended amino acid 
sequence at its carboxy terminus wherein the extension is the 
carboxy terminal peptide of human chorionic gonadotropin or a 

30 variant thereof. As described in this application, the required 
positions for the carboxy terminal peptide (CTP) are from any 
one of positions 112-118 to position 145 of the j3 subunit of 
human chorionic gonadotropin. As further explained in the PCT 
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application, variants of the CTP extension are obtained by- 
conservative amino acid substitutions such that the capacity of 
the CTP to alter the clearance characteristics is not destroyed. 
Fragments shorter than the sequence extending from positions 
5 112-118 to 145 are not specifically disclosed, nor are 
extensions from other than the C- terminus. 

The results specifically with CTP - extended /? subunit of FSH 
are also described in two papers by Applicants herein: LaPolt, 
P.S. et al.; Endocrinology (1992) 131 :2514-2520 and Fares, F.A. 

10 et al.; Proc Natl Acad Sci USA (1992) 89:4304-4308. Both of 
these papers are incorporated herein by reference. 

It has now been found that the alteration of the clearance 
properties desired can be achieved when the CTP is appended to 
the N-terminus instead of to the C-terminus of the unmodified 

15 peptide or appended to both, or placed in any noncritical 

region of the peptide, and that this effect may also be achieved 
using only portions or fragments of the CTP which must, however, 
contain at least one O-glycosylation site. The desired effects 
are retained when tandem extensions are employed. 

20 Disclosure of the Invention 

The invention provides modified forms of proteins and 
peptides that have biological activity, which modified forms 
have altered clearance properties more desirable than those of 
the unmodified peptide or protein. The modification comprises 

25 providing an additional amino acid sequence in a noncritical 

region of the peptide or protein, including at the N-terminus or 
C-terminus or at both termini wherein the extension comprises at 
least one CTP-derived sequence. The CTP-derived sequence is the 
native amino acid sequence at positions 112-118 to 145 or 

30 variant thereof as herein defined, and must either be only a 

fragment of this sequence or a tandem form of the sequence when * 
only the carboxy terminus of the peptide is thus extended. 

Thus, in one aspect, the invention is directed to a 
modified protein or peptide pharmaceutical wherein the 

35 modification comprises an extension of or insertion into the 

amino acid sequence of the protein or peptide, said insertion or 
extension consisting essentially of at least one sequence 



WO 94/24148 



PCT/US94/04069 



representing the carboxy terminal peptide {CTP) of human 
chorionic gonadotropin 0-subunit or its variant, or a portion 
thereof containing at least one O-glycosylation site. Said 
portion contains at least one amino acid less than the amino 
5 acid sequence 118-145 of said CTP. If only the C-terminus of 
the biologically active peptide or protein is extended, said 
extension must consist essentially of said portion, rather than 
the entire CTP unit. 

In other aspects, the invention is directed to recombinant 
10 materials and methods to produce the modified proteins and 
peptides of the invention and to pharmaceutical compositions 
containing them. 

Brief Description of the Drawings 

Figures 1A and IB show the construction of a human FSHj3 
15 subunit extended by one or two CTP units respectively. Figure 
1C shows the insertion of the extended FSHjS subunit into the 
expression vector pM a . 

Figure 2 is a photocopy of the gel obtained form SDS-PAGE 
conducted on immunoprecipitates of lysates and media of CHO 
20 cells containing expression vectors for native and truncated 
forms of human CGj3. 

Figure 3 shows the details of the fusion of a single CTP 
unit representing positions 115-145 to the carboxy terminus of 
FSHj3 as described in Figure 1A. 
25 Figure 4 shows the positions for insertion of a CTP subunit 

representing positions 118-145 at the amino or carboxy terminus 
of the a subunit . 

Figure 5 diagrams the construction of the a subunit 
extended at the carboxy terminus by CTP (aC) . 
30 Figure 6 diagrams the construction of Cof wherein a CTP unit 

representing positions 118-145 of hCG/3 is inserted between amino 
acids 3 and 4 of the a subunit. 

Figure 7 is a photocopy of SDS-PAGE results obtained from 
labeled lysates and media of CHO cells transfected with 
35 expression vectors for native or subunit and orC. 
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Figure 8 is a graph showing the in vitro biological 
activity of native FSH and FSH containing either an extended /} 
subunit or aC or both. 

Figure 9 is a graph showing in vitro biological activity of 
5 native FSH and FSH containing Car or both. 

Figure 10 shows the amino acid sequence and numbering of 
positions 112-145 of human CGj3 . 

Modes of Carrying Out the Invention 

Human chorionic gonadotropin (hCG) is one of at least four 

10 "reproductive" hormones in a family which also includes follicle 
stimulating hormone (FSH) , luteinizing hormone (LH) , and thyroid 
stimulating hormone (TSH) . All of these hormones are comprised 
of a subunits which, for a given species, are identical in amino 
acid sequence among the group, and /3 subunits which differ 

15 according to the member of the family. The /3 subunit of hCG is 
substantially larger than the other /J subunits in that it 
contains approximately 34 additional amino acids at the C- 
terminus referred to herein as the carboxy terminal portion 
(CTP) which is considered responsible for the comparatively 

20 longer serum half-life of hCG as compared to other gonadotropins 
(Matzuk, M. et al . , Endocrinol (1989) 126:376). In the native 
hormone, this CTP extension contains four mucin-like O-linked 
oligosaccharides . 

As used herein, human alpha subunit, and human FSH, LH, 

25 TSH, and CG beta subunits as well as the heterodimeric forms 

have in general their conventional definitions and refer to the 
proteins having the amino acid sequences known in the art per 
se, or allelic variants thereof, deliberately constructed 
muteins thereof maintaining the activity of the native protein 

30 regardless of the glycosylation pattern exhibited, or mutant 
forms thereof having at least 90% homology, preferably 95% 
homology, with the native forms. 

"Native" forms of these peptides are those which have the 
amino acid sequences isolated from human tissue, and have these 

3 5 known sequences per se, or their allelic variants. 

"Mutein" forms of these proteins are those which have 
deliberate alterations in amino acid sequence produced by, for 
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example, site-specific mutagenesis or by other recombinant 
manipulations, or which are prepared synthetically. These 
alterations result in amino acid sequences wherein the 
biological activity of the subunit is retained and/or wherein 
5 the subunit has at least 90% homology, preferably 95% homology, 
with the native form. 

Although it is recognized that . glycosylation pattern has a 
profound influence on activity both qualitatively and 
quantitatively, for convenience the terms FSH, LH, TSH, and CG 

10 beta subunits refers to the amino acid sequence characteristic 
of the peptides, as does "alpha subunit." When only the beta 
chain is referred to, the terms will be, for example, FSH beta; 
when the heterodimer is referred to, the simple term "FSH" will 
be used. It will be clear from the context in what manner the 

15 glycosylation pattern is affected by, for example, recombinant 
expression host or alteration in the glycosylation sites. Forms 
of the glycoprotein with specified glycosylation patterns will 
be so noted. 

As used herein "peptide" and "protein" are used 
20 interchangeably, since the length distinction between them is 
arbitrary. 

"Noncritical" regions of peptides or proteins are those 
regions of the molecules not required for biological activity. 
In general, these regions are removed from binding sites, 

25 precursor cleavage sites, and catalytic regions. 

As used herein, the "CTP unit" refers to an amino acid 
sequence found at the carboxy terminus of human chorionic 
gonadotropin /3 subunit which extends from amino acid 112-118 to 
residue 145 at the C-terminus or to a portion thereof. Thus, 

30 each "complete" CTP unit contains 28-34 amino acids, depending 
on the N- terminus of the CTP. The native sequence of positions 
112-145 is shown in Figure 10. 

By a "partial" CTP unit is meant an amino acid sequence 
which occurs between positions 112-118 to 145 inclusive, but 

35 which has at least one amino acid deleted from the shortest 

possible "complete" CTP unit (i.e. from positions 118-145) . The 
"partial" CTP units included in the invention must contain at 
least one O-glycosylation site. The CTP unit contains four such 
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sites at the serine residues at positions 121 (site 1) ; 127 
(site 2); 132 (site 3); and 138 (site 4). The partial forms of 
CTP useful in the invention will contain one or more of these 
sites arranged in the order in which they appear in the native 
5 CTP sequence. Thus, the "partial" CTP unit useful in the 

invention may include all four glycosylation sites; sites 1, 2 
and 3; sites 1, 2 and 4; sites 1, 3 and 4; sites 2, 3 and 4; or 
simply sites 1 and 2; 1 and 3; 1 and 4; 2 and 3; 2 and 4; or 3 
and 4; or may contain only one of sites 1, 2, 3 or 4. 

10 By "tandem" extensions is meant that the insert or 

extension contains at least two "CTP units". Each CTP unit may 
be complete or a fragment, and native or a variant. All of the 
CTP units in the tandem extension or insert may be identical, or 
they may be different from each other. Thus, for example, the 

15, tandem extension or insert may generically be partial -complete ; 
partial -partial; partial -complete -partial; complete -complete- 
partial, and the like wherein each of the noted partial or 
complete CTP units may independently be either a variant or the 
native sequence. The nature of variants is further explained 

20 below. 

Variants 

The "CTP unit" may correspond exactly to the native CTP 
sequence, or may be a variant wherein 1-5 of the amino acids 
contained in the sequence is substituted by a conservative 

25 analog of the native amino acid residue at that position, and 

wherein said substitutions taken cumulatively do not result in a 
substantial change in the stability conferring properties of the 
CTP unit. "Conservative analog" means, in the conventional 
sense, an analog wherein the residue substituted is of the same 

30 general amino acid category as that for which substitution is 
made. Amino acids have been classified into such groups, as is 
understood in the art, by, for example, Dayhoff, M. et al., 
Atlas of Protein Sequences and Structure (1972) 5:89-99. In 
general, acidic amino acids fall into one group; basic amino 

35 acids into another; neutral hydrophilic amino acids into 
another; and so forth. 
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More specifically, amino acid residues can be generally 
subclassif ied into four major subclasses as follows: 

Acidic: The residue has a negative charge due to loss of H 
ion at physiological pH and the residue is attracted by aqueous 
5 solution so as to seek the surface positions in the conformation 
of a peptide in which it is contained when the peptide is in 
aqueous medium at physiological pH. . 

Basic: The residue has a positive charge due to 
association with H ion at physiological pH and the residue is 
10 attracted by aqueous solution so as to seek the surface 
positions in the conformation of a peptide in which it is 
contained when the peptide is in aqueous medium at physiological 
pH. 

Neutral /nonpolar: The residues are not charged at 
15 physiological pH and the residue is repelled by aqueous solution 
so as to seek the inner positions in the conformation of a 
peptide in which it is contained when the peptide is in aqueous 
medium. These residues are also designated "hydrophobic" 
herein. 

20 Neutral /polar : The residues are not charged at 

physiological pH, but the residue is attracted by aqueous 
solution so as to seek the outer positions in the conformation 
of a peptide in which it is contained when the peptide is in 
aqueous medium. 

25 It is understood, of course, that in a statistical 

collection of individual residue molecules some molecules will 
be charged, and some not, and there will be an attraction for or 
repulsion from an aqueous medium to a greater or lesser extent. 
To fit the definition of "charged, " a significant percentage (at 

30 least approximately 25%) of the individual molecules are charged 
at physiological pH. The degree of attraction or repulsion 
required for classification as polar or nonpolar is arbitrary 
and, therefore, amino acids specifically contemplated by the 
invention have been classified as one or the other. Most amino 

35 acids not specifically named can be classified on the basis of 
known behavior. 

Amino acid residues can be further subclassif ied as cyclic 
or noncyclic, and aromatic or nonaromatic, self-explanatory 
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classifications with respect to the side chain substituent 
groups of the residues, and as small or large. The residue is 
considered small if it contains a total of 4 carbon atoms or 
less, inclusive of the carboxyl carbon. Small residues are, of 
5 course, always nonaromatic. 

For the naturally occurring protein amino acids, 
subclassif ication according to the foregoing scheme is as 
follows . 

Acidic : Aspartic acid and Glutamic acid; 

10 Basic/noncvclic : Arginine, Lysine; 

Basic/cyclic : Histidine; 

Neutral /polar /small : Glycine, serine, 
cysteine; 

Neutral /nonpolar/small : Alanine; 

15 Neutral /polar /large/nonaromatic : Threonine, Asparagine, 

Glutamine ; 

Neutral /polar /large aromatic : Tyrosine ; 

Neu t r al /nonpo 1 ar / 1 arge /nonaroma t i c : Valine, Isoleucine, 
Leucine , Methionine ; 

20 Neut ral /nonpo 1 ar / 1 arge / aromat i c : Phenylalanine, and 

Tryptophan . 

The gene-encoded secondary amino acid proline, although 
technically within the group neutral/nonpolar/ large/cyclic and 
nonaromatic, is a special case due to its known effects on the 
25 secondary conformation of peptide chains, and is not, therefore, 
included in this defined group. 

If the modified peptides of the invention are constructed 
by modification of the gene, the CTP units will contain only 
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gene encoded amino acid substitutions; however, if the CTP unit 
is synthesized by standard, for example, solid phase, peptide 
synthesis methods and ligated, for example, enzymatically , to 
the C-terminus of the acceptor peptide or protein, non-gene 
5 encoded amino acids, such as aminoisobutyric acid (Aib) , 

phenylglycine (Phg) , and the like can also be substituted for 
their analogous counterparts . 

These non-encoded amino acids also include, for example, 
beta-alanine (beta-Ala) , or other omega-amino acids, such as 3- 

10 amino propionic, 4 -amino butyric and so forth, sarcosine (Sar) , 
ornithine (Orn) , citrulline (Cit) , t-butylalanine (t-BuA) , t- 
butylglycine (t-BuG) , N-methylisoleucine (N-Melle) , and 
cyclohexylalanine (Cha) , norleucine (Nle) , cysteic acid (Cya) 2- 
naphthylalanine (2-Nal) ; l,2,3,4-tetrahydroisoquinoline-3- 

15 carboxylic acid (Tic) ; me r cap t oval eric acid (Mvl) ; 0-2- 

thienylalanine (Thi) ; and methionine sulfoxide (MSO) . These 
also fall conveniently into particular categories. 
Based on the above definitions, 

Sar and beta-Ala and Aib are neutral /nonpolar/ small; 
20 t-BuA, t-BuG, N-Melle, Nle, Mvl and Cha are 

neutral/nonpolar/large/nonaromatic; 
Orn is basic/noncyclic; 
Cya is acidic; 

Cit, Acetyl Lys, and MSO are neutral /polar/ 
25 large/nonaromatic; and 

Phg, Nal, Thi and Tic are neutral/nonpolar/large/ aromatic. 

The various omega-amino acids are classified according to 
size as neutral/nonpolar/small (beta-Ala, i.e., 3- 
aminopropionic, 4-aminobutyric) or large (all others) . 
30 Thus, amino acid substitutions other than those encoded in 

the gene can also be included in peptide compounds within the 
scope of the invention and can be classified within this general 
scheme according to their structure. 

Preferred Embodiments of CTP Units 



35 



The notation used for the CTP units of the invention is as 
follows: for portions of the complete CTP unit, the positions 
included in the portion are designated by their number as they 
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10 



15 



20 



25 



30 



35 



appear in Figure 11 herein. Where substitutions occur, the 

substituted amino acid is provided along with a superscript 

indicating its position. Thus, for example, CTP (120-143) 

represents that portion of CTP extending from positions 120 to 

143; CTP (120-130; 136-143) represents a fused amino acid 

sequence lacking positions 118-119, 131-135, and 144-145 of the 

native sequence. CTP (Arg 122 ) refers to a variant wherein the 

lysine at position 122 is substituted by an arginine; CTP 

(lie 134 ) refers to a variant wherein the leucine at position 134 

128 143 

is substituted by isoleucine. CTP (Val Val x J ) represents a 
variant wherein two substitutions have been made, one for the 
leucine at position 128 and the other for the isoleucine at 
position 142. CTP (120-143; He 128 Ala 130 ) represents the 
relevant portion of the CTP unit where the two indicated 
substitutions have been made. 

Particularly preferred are those CTP units of the following 
formulas : 



#1 


CTP 


(116 


-132) 


#2 


CTP 


(118 


-128; 


#3 


CTP 


(117 


-142) 


#4 


CTP 


(116 


-130) 


#5 


CTP 


(116 


-123; 


#6 


CTP 


(115 


-133; 


#7 


CTP 


(117 


-140, 


#8 


CTP 


(125 


-143, 


#9 


CTP 


(135 


-145, 


#10 


CTP 


(131 


-143, 


#11 


CTP 


(118 


-132) 


#12 


CTP 


(118 


-127) 


#13 


CTP 


(118 


-145) 


#14 


CTP 


(115 


-132) 


#15 


CTP 


(115 


-127) 


#16 


CTP 


(115 


-145) 


#17 


CTP 


(112 


-145) 


#18 


CTP 


(112 


-132) 


#19 


CTP 


(112 


-127) 



.123 
.139> 



Gin 140 ) 



, 142 Cha 143 ) 
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Modified Peptides and Proteins 

Any peptide or protein of biological significance is 
subject to modification according to the invention method. 
Included among such candidates for modification, therefore, are 
5 peptide hormones, such as the four human "reproductive" hormones 
set forth above, including the /3-chains thereof; insulin; human 
growth hormone and growth hormone of other species; enkephalin; 
ACTH; glucagon; and the like. Also useful as subjects for the 
modification of the invention are various growth factors such as 

10 insulin-like growth factors; epidermal growth factors; acidic 
and basic fibroblast growth factors; platelet-derived growth 
factors; the various colony stimulating factors, such as 
granulocyte CSF, macrophage -CSF, and the like; as well as the 
various cytokines such as IL-2, IL-3 and the plethora of 

15 additional interleukin proteins; the various interferons; tumor 
necrosis factor; and the like. Enzymes such as tPA (which has a 
very short half life) , urokinase and thrombin may also be 
modified. Also candidates for the method of the invention are 
short peptide sequences such as luteinizing hormone releasing 

20 hormone (LHRH) ; somatostatin; growth hormone releasing factor 

(GHRF) ; and the endorphins. Additional protein medicaments such 
as alveolar surfactant proteins; natriuretic factors; adhesions; 
receptor peptides; receptor binding ligands in general; 
antibodies and fragments thereof; and any other useful peptide 

25 or protein with a desired biological function can be modified 
according to the methods described herein. 

It should be noted, of course, that the peptide or protein 
into which the CTP unit(s) is (are) inserted or to which the CTP 
unit(s) is (are) added as an extension could also be in modified 

30 form from that ordinarily occurring biologically, as long as 
biological activity is retained. 

A particularly preferred candidate for modification with 
the CTP units of the invention is the a subunit of the 
reproductive hormones. This has the advantage that coupling of 

3 5 the modified a subunit with the corresponding 0 units results in 
an entire set of hormones for which bioactivity can be extended. 
The CTP unit or tandem units may be fused at either or both of 
the carboxy or amino terminus of the a subunit; however, the 
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amino terminus is preferred. Available evidence shows that the 
amino terminus is not involved in assembly with the /3 subunit 
nor is it associated with receptor binding determinants. 

As set forth above, the insertions or extensions of the CTP 
5 unit(s) must be in a region of the peptide or protein that is 
noncritical for the biological activity desired. Thus, regions 
critical for inducing proper folding, binding to receptors, 
catalytic activity and the like should be avoided. Similarly, 
regions which are critical to assure the three-dimensional 

10 conformation of the protein should be avoided. The 

ascertainment of noncritical regions is readily accomplished by 
deleting or modifying candidate regions and conducting an 
appropriate assay for the desired activity. Regions where 
modifications result in loss of activity are critical; regions 

15 wherein the alteration results in the same or similar activity 
are considered noncritical. 

In many cases, the locations of critical regions are 
known. For example, for the a-subunit, position 33-59 are 
thought to be necessary for signal transduction and the 20 amino 

20 acid stretch at the carboxy terminus is needed for signal 

transduct ion/receptor binding. Residues critical for assembly 
with the /J-subunit include at least residues 33-58, particularly 
37-40. 

If the approaching or peptide candidate has not been 
25 studied adequately, as set forth above, noncritical regions may 
be ascertained by deleting or modifying candidate regions in 
conducting an appropriate assay for the desired activity. An 
appropriate starting point for proteins in general is at the NRC 
terminus; however, in some particular cases, these starting 
30 points will not necessarily yield successful results. In 

addition to the N-terminus and C-terminus per se, insertion in 
regions close to these termini also represents reasonable 
starting points. In order for the insertion of the CTP unit or 
portion thereof, some small number of amino acids may be deleted 
35 from the noncritical region. Preferred embodiments of the 
modified proteins and peptides of the invention include the 
following: 

CGa (1-3) -CTP#1- (4-92) 



WO 94/24148 



PCT/US94/04069 



- 13 - 

CGa (1-3) -CTP#11- (4-92) 

CGa (1-3) -CTP#12- (4-92) 

CGa (1-92) -CTP#11 

CGa (1-92) -CTP#12 
5 CGa (l-3)-CTP#17 (4-92) 

CGa (l-3)-CTP#18 (4-92) 

CGa (l-3)-CTP#19 (4-92) 

FSH/3 (1-111) -CTP#11 

FSH/J (1-111) -CTP#12 
10 FSH/J (1-111) -CTP#11-CTP#12 

proinsulin (1-3) -CTP#5-CTP#6 (4-end) 

FSHj3 (1-111) -CTP#13-CTP#13 

TSH/J (1-110) -CTP#11 

TSH/J (1-110) -CTP#12 
15 LH/3 (1-114) -CTP#14 

LHj3 (1-114) -CTP#15 

LH/3 (1-114) -CTP#16 

LHjS (1-121) -CTP#11 

LH/3 (1-121) -CTP#12 
20 LHjS (1-121) -CTP#13 

tPA (l-2)-CTP#14 (3-end) 

tPA (l-2)-CTP#15 (3-end) 

tPA (l-2)-CTP#16 (3-end) 

hGH (l-4)-CTP#ll (5-end) 
25 hGH (l-4)-CTP#12 (5-end) 

hGH (l-4)-CTP#13 (5-end) 

IL-3 (1-end) -CTP#14 

IL-3 (1-end) -CTP#15 

IL-3 (1-end) -CTP#16 



30 Coupled Forms 

The modified peptides and proteins of the invention may be 
further conjugated or derivatized in ways generally understood 
to derivatize amino acid sequences, such as phosphorylation, 
glycosylation, deglycosylation of ordinarily glycosylated forms, 

35 modification of the amino acid side chains (e.g., conversion of 
proline to hydroxyproline) and similar modifications analogous 
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to those post-translational events which have been found to 
occur generally. 

As is generally known in the art, the modified peptides and 
proteins of the invention may be coupled to labels, drugs, 
5 targeting agents, carriers, solid supports, and the like, 

depending on the desired application. The labeled forms of the 
modified biologicals may be used to track their metabolic fate; 
suitable labels for this purpose include, especially, 
radioisotope labels such as iodine 131, technetium 99, indium 

10 111, and the like. The labels may also be used to mediate 

detection of the modified proteins or peptides in assay systems; 
in this instance, radioisotopes may also be used as well as 
enzyme labels, fluorescent labels, chromogenic labels, and the 
like. The use of such labels is particularly helpful if the 

15 peptide or protein is itself a targeting agent such as an 
antibody or a receptor ligand. 

Conversely, if the modified peptide or protein is a 
targeting ligand, primarily, and is relatively free of 
metabolism-altering activity, the modified compound of the 

20 invention may be conjugated to an appropriate drug, such as an 
antiinflammatory drug, an antibiotic, a toxin, and the like. 
The modified compounds of the invention may also be coupled to 
carriers to enhance their immunogenicity in the preparation of 
antibodies specifically immunoreactive with these new modified 

25 forms. Suitable carriers for this purpose include keyhole 
limpet hemocyanin (KLH) , bovine serum albumin (BSA) and 
diphtheria toxoid, and the like. Standard coupling techniques 
for linking the modified peptides of the invention to carriers, 
including the use of bifunctional linkers, can be employed. 

30 Similar linking techniques, along with others, may be 

employed to couple the modified peptides and proteins of the 
invention to solid supports. When coupled, these modified 
peptides and proteins can then be used as affinity reagents for 
the separation of desired components with which specific 

35 reaction is exhibited. 
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Preparation Methods 

Methods to construct the modified peptide and protein 
biologically active compounds of the invention are well known in 
the art. As set forth above, if only gene encoded amino acids 
5 are included, the most practical approach at present is to 

synthesize these materials recombinantly by modification of the 
DNA encoding the desired peptide. Techniques for site-directed 
mutagenesis, ligation of additional sequences, and construction 
of suitable expression systems are all, by now, well known in 

10 the art. The DNA encoding the CTP unit (s) to be added to the 

DNA encoding the desired peptide or protein is most conveniently 
constructed synthetically using standard solid phase techniques, 
preferably to include restriction sites for ease of ligation, 
and coupled to the sequence encoding the candidate peptide or 

15 protein. If the DNA encoding the candidate peptide or protein 
is not already a part of an expression system containing 
suitable control elements for transcription and translation of 
the included coding sequence, the modified DNA coding sequences 
are provided with these features. As is well known, expression 

20 systems are now available compatible with a wide variety of 
hosts, including procaryotic hosts such as bacteria and 
eucaryotic hosts such as yeast, plant cells, insect cells, 
mammalian cells, avian cells, and the like. 

When the unmodified peptide is a reproductive hormone 

25 including the a-subunit, whether the a-subunit is modified or 
unmodified, recombinant production of the appropriate a-subunit 
is preferably effected using a "minigene" construction. 

As used herein, the alpha subunit 11 minigene" refers to the 
gene construction disclosed in Matzuk, M.M., et al, Mol 

30 Endocrinol (1988) 2x 95-100, in the description of the 

2 2 
construction of pM /CG alpha or pM /alpha. This "minigene" is 

characterized by retention only of the intron sequence between 

exon III and exon IV, all upstream introns having been deleted. 

In the particular construction described, the N-terminal coding 

35 sequences which are derived from exon II and a portion of 

exon III are supplied from cDNA and are ligated directly through 

an Xbal restriction site into the coding sequence of exon III so 

that the introns between exons I and II and between exons II and 
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III are absent. However, the intron between exons III and IV as 
well as the signals 3' of the coding sequence are retained. The 
resulting minigene can conveniently be inserted as a BamHI/Bglll 
segment. Other means for construction of a comparable minigene 
5 are, of course, possible and the definition is not restricted to 
the particular construction wherein the coding sequences are 
ligated through an Xbal site. However, this is a convenient 
means for the construction of the gene, and there is no 
particular advantage to other approaches, such as synthetic or 

10 partially synthetic preparation of the gene. The definition 
includes those coding sequences for the alpha subunit which 
retain the intron between exons III and IV, or any other intron 
and preferably no other introns. 

For recombinant production, transfected host cells using 

15 expression systems are used and cultured to produce the desired 
protein. These terms are used herein as follows: 

A "transfected" recombinant host cell, i.e., a cell 
"transfected" with the recombinant expression systems of the 
invention, refers to a host cell which has been altered to 

20 contain this expression system by any convenient manner of 

introducing it, including transf ection, viral infection, and so 
forth. "Transfected" refers to cells containing this expression 
system whether the system is integrated into the chromosome or 
is extrachromosomal . The "transfected" cells may either be 

25 stable with respect to inclusion of the expression system or 
not. In short, "transfected" recombinant host cells with the 
expression system of the invention refers to cells which include 
this expression system as a result of their manipulation to 
include it, when they natively do not, regardless of the manner 

30 of effecting this incorporation. 

"Expression system" refers to a DNA sequence which includes 
a coding sequence to be expressed and those accompanying control 
DNA sequences necessary to effect the expression of the coding 
sequence. Typically, these controls include a promoter, 

35 termination regulating sequences, and, in some cases, an 

operator or other mechanism to regulate expression. The control 
sequences are those which are designed to be functional in a 
particular target recombinant host cell and therefore the host 
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cell must be chosen so as to be compatible with the control 
sequences in the constructed expression system. 

As used herein "cells," "cell cultures," and "cell lines" 
are used interchangeably without particular attention to nuances 
5 of meaning. Where the distinction between them is important, it 
will be clear from the context. Where any can be meant, all are 
intended to be included. 

The protein produced may be recovered from the lysate of 
the cells if produced intracellularly, or from the medium if 
10 secreted. Techniques for recovering recombinant proteins from 
cell cultures are well understood in the art, and these proteins 
can be purified using known techniques such as chromatography, 
gel electrophoresis, selective precipitation, and the like. 
Alternatively, if the candidate biological is a short 
15 peptide or if enzymatic transfer of the subunit can be effected, 
the CTP unit(s) of the invention may be synthesized directly 
using in vitro solid phase peptide synthesis techniques and 
under these conditions, if desired, the CTP subunit may be 
modified by analogous amino acids which are not gene encoded. 

20 Antibodies 

The modified peptides and proteins of the invention may be 
used to generate antibodies specifically immunoreactive with 
these new compounds. These antibodies are useful in a variety 
of diagnostic and therapeutic applications, depending on the 
25 nature of the biological activity of the unmodified peptide or 
protein. 

The antibodies are generally prepared using standard 
immunization protocols in mammals such as rabbits, mice, sheep 
or rats, and the antibodies are titered as polyclonal antisera 

3 0 to assure adequate immunization. The polyclonal antisera can 

then be harvested as such for use in, for example, immunoassays. 
Antibody- secreting cells from the host, such as spleen cells, or 
peripheral blood leukocytes, may be immortalized using known 
techniques and screened for production of monoclonal antibodies 

35 immunospecif ic with the modified peptides of the invention. 

By "immunospecif ic for the modified peptides" is meant 
antibodies which are immunoreactive with the CTP-unit-modif ied 
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forms of the peptides or proteins, but not with the unmodified 
portions within the general parameters considered to determine 
affinity or nonaffinity. It is understood that specificity is a 
relative term, and an arbitrary limit could be chosen, such as a 
5 difference in immunoreactivity of 100- fold or greater. Thus, an 
immunospecif ic antibody included within the invention is at 
least 100 times more reactive with the modified protein or 
peptide than with its unmodified form. 

Formulation 

10 The modified peptides or proteins of the invention are 

formulated and administered using methods comparable to those 
known for the unmodified peptide or protein corresponding to the 
modified form. Thus, formulation and administration methods 
will vary according to the candidate unmodified form. However, 

15 the dosage level and frequency of administration may be reduced 
as compared to the unmodified form in view of the extended 
biological half life of the modified peptide or protein. 

Formulations for the peptides and proteins modified 
according to the invention are those typical of protein or 

20 peptide drugs such as found in Remington's Pharmaceutical 

Sciences , latest edition, Mack Publishing Company, Easton, PA. 
Generally, proteins or peptides are administered by injection, 
typically intravenous, intramuscular, subcutaneous, or 
intraperitoneal injection, or using formulations for 

25 transmucosal or transdermal delivery. These formulations 

generally include a detergent or penetrant such as bile salts, 
fusidic acids, and the like. These formulations can be 
administered as aerosols or suppositories or, in the case of 
transdermal administration, in the form of skin patches. 

30 Oral administration is also possible provided the 

formulation protects the peptides of the invention from 
degradation in the digestive system. 

Optimization of dosage regimen and formulation is conducted 
as a routine matter and as generally performed in the art. 

35 The following examples are intended to illustrate but not 

to limit the invention. 
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Example 1 

Preparation of Human 6 Subunit 

with Two CTP Unit Tandem Extensions 

Figures 1A and IB show the construction of an expression 

5 vector wherein the /3-chain of human FSH is modified to include 

two CTP units. As shown in Figure IB, the Hindlll site at the 

3 '-terminus of the human FSH-/3 subunit, extended by one CTP unit 

is used to couple the CTP unit from the 3' terminus of the human 

HCG-/3 gene to obtain the extended & subunit. The hFSH-0 (CTP) 

2 

10 gene is then ligated into the expression vector pM to obtain an 
expression system capable of producing the extended form of the 
FSH-/3 chain in mammalian cells. The construction of the host 
expression vectors is described by Matzuk, M.M. et al., Proc 
Natl Acad Sci USA (1987) 84:6354-6358; Matzuk, M.M. et al . , J 

15 Cell Biol (1988) 106 ; 1049-1059 . 

In more detail, to create hFSH/3 chimera bearing a single 
unit of the O-linked terminal region of hCG/3 subunit hFSH/3 
(CTP) ) a Hindlll site was created in the stop codon of hFSH/3 
gene at codon 111 and in the hCG/3 gene at codon 118 (Figure 1A) . 

20 The Hindi 1 1 -Hindi 1 1 fragment from the hFSH/3 gene was ligated in 
frame to the CG/3 BamHI-Hindlll fragment. This chimera 

Tift lift 

(hFSH0(CTP)) contained a ser to Ala x change at the ligation 

point, which was corrected by oligonucleotide-directed 

mutagenesis. The chimera containing two tandem CTP repeats 

25 (hFSH/3 (CTP) 2 was constructed by creating a new Hindlll site in 

the stop codon of the hFSH/3 (CTP) chimera (Figure IB). The 

Hindi I I -Hindi I I fragment was ligated to the BamHI-Hindlll 

fragment from hCG/J. The generated ala codon can be reconverted 

to a serine codon as described above. 

30 To insert hFSH/3 (CTP) or hFSH/3 (CTP) 0 genes into the 

2 

eucaryotic expression vector pM , the Hindlll sites at the 5' 
ends were converted to BamHI sites using Klenow and BamHI 
oligonucleotide linker (Figure 1C) , and the BamHI-BamHI 
fragments containing the hFSH/3CTP or hFSH/3 (CTP) 0 genes were 
35 inserted into the BamHI site in pM . The correct orientation 
was confirmed by restriction enzyme analysis and the entire 
sequence of exon III was sequenced to confirm the specificity of 
the mutagenesis. 
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Example 2 
Effect of CTP Tandem Extensions 
The human FSH containing the j3 subunit extended by two CTP 
units prepared as set forth in Example 1 above was injected into 
5 rats. 24 Sprague-Dawley female rats were used in the study. 12 
rats were each injected with 10 IU unmodified FSH formulated in 
MEM medium; 12 rats were injected with 10 IU FSH comprising 
hFSHj3(CTP) 2 formulated in MEM medium. Serum was withdrawn 
immediately and several times during the first hour, and then 
10 after 2, 4 and 8 hours. The serum was assayed using standard 
radioimmunoassay techniques for FSH hormone. The results 

showed that while the amount of unmodified FSH in the serum 
declined from about 0.5 IU/ml to less than 0.05 IU/ml over an 8 
hour period, the modified FSH of the invention containing two 
15 CTP units remains substantially unchanged over this time period 
declining from about 0.8 IU/ml to about 0.5 IU/ml. 

Example 3 

Construction of CTP Units that 

Represent "Partial" Units 

20 The gene encoding human chorionic gonadotropin j3 subunit is 

2 

inserted into the plasmid pM (supra) at the BamHI site 
downstream of the long terminal repeat (LTR) . A BamHI/Bglll) 
fragment is subcloned into M13 for site-directed mutagenesis. 
Stop codons are thus provided in place of the arginine residue 
25 at position 133 or in place of the leucine residue at position 
128. The mutated fragment is then reinserted into the pM host 
vector. 

The vectors encoding truncated forms of human CG0 were then 

transfected into CHO cells and the transfected cells cultured 

35 

30 and labeled for 7 hours with S-cysteine. Lysates and media 
were prepared and immunoprecipitated with CG)3 antiserum. The 
precipitates were then subjected to SDS polyacrylamide gel 
electrophoresis with results shown in Figure 2. In Figure 2, 
the lysate and media lanes are labeled according to the carboxy 

35 terminus. It is well established that most of the serine-O- 
linked glycosylation is attached just prior to secretion. 
Threrefore there is a mobility shift reflecting the increase in 
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molecular weight of the subunit when it is secreted. As seen 
here, there is a progressive decrease in molecular weight shift 
from lysate to medium in the shortened forms of hCG/3. The data 
show lower molecular weights for the secreted forms of the 
5 truncated subunits, thus indicating the absence of serine 
glycosylation sites. 

Example 4 

Construction of Modified FSHfl And Other Peptides 
The CTP unit representing the "complete" form or the 

10 "portion" form is then ligated to FSHj3 by cleaving both FSH/J and 
the mutated or unmutated hCG/3 gene with Hindlll and ligating the 
resultant upstream portion of FSH with the CTP unit, as shown in 
Figure 1A. Religation results in a ser-to-ala substitution at 
position 118 of the CTP unit. If desired, this can be 

15 reconverted to a codon for serine, as shown in Figure 3. 

Similarly, the C-terminal extended form of human 
proinsulin, growth hormone, and a-subunit are prepared from the 
CTP unit in complete or portion forms. 

The CTP unit can be inserted either at the amino or carboxy 

20 terminus of the a subunit, as shown in Figure 4. As there 
illustrated, positions 118-145 of hCGjS are inserted. 

In more detail, to construct chimeras containing the CTP 
unit "complete" or as a "portion" at the carboxy terminus of the 
a-subunit (orC) , the BamHI-Bgllll and Hindlll-BamHI fragments 

25 containing the alpha subunit and hCG/J genes, respectively, are 
subjected to the polymerase chain reaction (PCR) (Horton, R.M. 
et al., Gene (1989) 77:51-59). This is diagramed in Figure 5. 

The GH fragment shown, which was transcribed from the a 
minigene (author) ( J Cell Biol (1988) 106 : 1049-1059) contains 

30 the entire translated sequence of the o? subunit, and the I J 

fragment containing the CTP sequence from the CG/3 subunits, were 
amplified in separate PCR reactions. The 3' end of primer H 
used to amplify the GH fragment is complementary to the 5' end 
of the CTP sequence. Similarly, the 5' end of primer I was 

35 complementary to the 3' end of a sequence. Thus, by using two 
internal primers that overlap, the GH and I J fragments share 
"sticky ends". These fragments were purified and mixed together 
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and used as a template for PCR. The overlapping ends allow one 
strand from each fragment to act as a primer on the other and 
adding G and I primers allows the amplificaton of the a-CTP 
chimera gene. The a-CTP gene was inserted into the BamHI site 
5 of pM 2 . 

The sequences of the primers used in the construction of aC 
are as follows: (5' — >3') 

H = GCCTTTGAGGAAGAGGAAGATTTGTGATAATAAC 
I = GTTATTATGACAAATCTTCCTCTTCCTCAAAGGC 
10 G = GTAAAACGACGGCCAGT 

J = AACAGCTATGACCATG 

Oligonucleotides G and J are for priming M13 mp 19 (New England 
Biolabs 1992 catalogue #1201 and 1211) . Oligonucleotides G and 
J prime downstream and upstream, respectively of the polylinker 
15 region. 

The CTP was also inserted into the N-terminal end of the a 
subunit at the amino acid 3/4 junction (Ca; Fig. 5) in a manner 
similar to that used for constructing the aC. CTP was inserted 
in an internal region of the subunit between amino acid 3 and 4 
20 (Fig. 6) . 

The sequences of the oligonucleotide primers used in the 
construction of Ca are: 

(A) 5 f -AGC GGA TAA CAA TTT CAC ACA GGA- 3 ' 

(E) 5' -CAT TCC GCT CCT GAT TCC TCT TCC TCA AAG-3 ' 
25 (B) 3'-GTA AGG CGA GGA CTA AGG AGA AGG AGT TTC-5' 

(C) 5'-CCG ATC CTC CCA CAA GTG CAG GAT TGC CCA-3' 

(F) 3'-GGC TAG GAG GGT GTT CAC GTC CTA ACG GGT-5 ' 

(D) 3' -ATT CTT GGA GTT CTA GGG GTC TTC GAAA-5 ' 
Oligonucleotides A and D are used to prime M13 . 

30 The 3 -end of primer B used to amplify the AB 

fragment provides a sequence complementary to the 5' -end of the 
CTP region. Similarly, the 5' -end of primer C, used to amplify 
the cd fragment, is complementary to the 3' -end of exon 2 in the 
of-subunit gene. The fragments were gel purified and mixed for 

35 use as a PCR template. The overlapping ends permit one strand 
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from each fragment to act as a primer for the other; adding a 
and d primers allows amplification of the entire chimera. 

Correct orientation was confirmed by- 
restriction enzyme analysis, and the resulting modified gene was 
5 sequenced to verify correct substitutions. 

Example 5 
Production of Modified Hormones 
The gene encoding the modified a- subunit (aC) 
was inserted into the BamHI site of pM and transfected alone or 

10 together with the CG/3 gene into Chinese hamster ovary cells. 

Continuous labeling of stable clones with S 35 cysteine shows that 
the secreted form of the modified peptide migrates more slowly 
than the corresponding unmodified form. In Figure 7, lanes 1 
and 3 represent the lysates of cells producing unmodified a- 

15 subunit and aC, respectively; lines 2 and 4 are the 

corresponding supernatants . Since the secreted form of modified 
aC has a much greater change in mobility with respect to the 
lysate form as compared to wild type a, it can be concluded that 
secreted aC contains O- linked oligosaccharides contributed by 

20 the CTP extension. 

Co-transfection of the FSH/J subunit with 
either aC or Ca in CHO cells resulted in dimers that were 
secreted with an efficiency comparable to native FSH. 

Example 6 

25 Biological Activity 

Native FSH and FSH dimers with aC or Ca were 
quantitated in conditioned medium with a FSH immunoradiometric 
assay and a double -antibody RIA (Diagnostic Products, Los 
Angeles) . The in vitro bioactivity of these hormones was 

30 determined by the granulosa cell aromatase bioassay as described 
(author J Biol Chem (1989) 264:4764-4774; author Endocrinology 
(1986) 119:1570-1577) . Stimulation of estrogen production by 
native FSH and by otC and Ca FSH was compared after a 3 -day 
culture period. 

35 Figure 8 compares the steroidogenic response 

of native FSH, (closed circles) aC-FSH/3 (open squares) , and FSH 
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0-CTP subunit/o? (open triangles) . It is seen that the presence 
of CTP on the carboxy end of the a subunit results in a lower 
biologic response (25-50 fold reduction) . 

In contrast, the presence of the CTP on the 
5 amino end of the a subunit (Ca) did not affect receptor binding 
or steroidogenesis (Fig. 9) of FSH dimers. Native FSH (open 
circles) , FSH/? extended at the carboxy terminus with a CTP unit 
dimerized with native a (open triangles) , FSH/3 subunit dimerized 
with Cot (closed circles) , and CTP-extended /JFSH dimerized with 

10 Car (closed triangles) all show similar activities. The modified 
forms at optimum concentration show somewhat higher activities 
than the native form. Preliminary results show Ca dimers will 
provide long acting agonists with biological activity comparable 
to the native hormone. 

15 Further results showed that modified proteins 

including partial CTP sequences particularly those lacking O- 
linked sites elicit a biologic response comparable to dimers 
that include full length CTP and are expected to be less 
antigenic. 
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Claims 

1. A modified protein or peptide 
pharmaceutical wherein said modification comprises insertion of 
a "complete" or "partial" CTP unit or variant thereof into a 

5 noncritical region of said protein or peptide, with the proviso 
that if said noncritical region is the C-terminus, said insert 
is of a partial CTP unit. 

2. The modified peptide or protein of claim 
1 wherein said CTP unit contains all four glycosylation sites 

10 (sites 1, 2, 3 and 4) found in native CTP unit; or 

wherein the glycosylation sites in said CTP 
unit consist of sites 1, 2 and 3; or sites 1, 3 and 4; or sites 
1, 2 and 4; or sites 2, 3 and 4; or 

wherein the glycosylation sites in said CTP 
15 unit consist of sites 1 and 2; or sites 1 and 3; or sites 1 and 
4; or sites 2 and 3; or sites 2 and 4; or sites 3 and 4; or 

wherein the glycosylation sites in said CTP 
unit consist of site 1, or site 2, or site 3, or site 4. 

3. The modified peptide or protein of claim 
20 1 wherein said partial CTP unit consists of positions 112-132; 

115-132; 116-132; or 118-132; or 112-127; 115-127; 116-127; or 
118-127. 

4. The modified peptide or protein of claim 
1 wherein said noncritical region is proximal to the C-terminus; 

25 or 

wherein said noncritical region is proximal to 

the N- terminus. 

5. The modified peptide or protein of claim 
1 which includes at least two CTP units. 



30 



6. The modified peptide or protein of claim 
5 wherein said at least two CTP units are different; or 
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wherein said at least two CTP units are 



identical . 

7. The modified peptide or protein of claim 
1 wherein said peptide or protein is ar-subunit of human 
5 reproductive enzymes and wherein said noncritical region is 
proximal to the N- terminus. 



1 wherein said peptide or protein is a cytokine. 

9. The modified peptide or protein of claim 
10 8 wherein said peptide or protein is IL-3. 



8. 



The modified peptide or protein of claim 



10. The modified peptide or protein of claim 
1 wherein said peptide or protein is proinsulin; or 

wherein said peptide or protein is human FSH/3; 



or 



15 



wherein said peptide or protein is human TSHjS; 



or 



wherein said peptide or protein is human LH/J. 



11. A pharmaceutical composition comprising 
the modified protein or peptide of claim 1 in admixture with a 
20 pharmaceutical^ acceptable excipient. 
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